A New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain
نویسنده
چکیده مقاله:
Quality of speech signal significantly reduces in the presence of environmental noise signals and leads to the imperfect performance of hearing aid devices, automatic speech recognition systems, and mobile phones. In this paper, the single channel speech enhancement of the corrupted signals by the additive noise signals is considered. A dictionary-based algorithm is proposed to train the speech and noise models for each subband of wavelet decomposition level based on the coherence criterion. Using the presented learning method, the self-coherence measure between different atoms of each dictionary and mutual coherence between the atoms of speech and noise dictionaries are minimized and lower sparse reconstruction error is yielded. In order to reduce the computation time, a composite dictionary is utilized including only the speech dictionary and one of the noise dictionaries selected corresponding to the noise condition in the test environment. The speech enhancement algorithm is introduced in two scenarios, supervised and semi-supervised situations. In each scenario, a voice activity detector (VAD) scheme is employed based on the energy of sparse coefficient matrices when the observed data is coded over the related dictionary. The presented VAD algorithms are based on the energy of the coefficient matrices in the sparse representation of the observation data over the specified dictionaries. These speech enhancement schemes are different in the mentioned scenarios. In the proposed supervised scenario, domain adaptation technique is employed to transform a learned noise dictionary into an adapted dictionary according to the noise conditions of the test environment. Using this step, the observed data is sparsely coded with low sparse approximation error based on the current situation of the noisy environment. This technique has a prominent role to obtain better enhancement results particularly when the noise signal has non-stationary characteristics. In the proposed semi-supervised scenario, adaptive thresholding of wavelet coefficients is carried out based on the variance of the estimated noise for each frame in different subbands. These implementations are carried out in two different conditions, the training and test steps, as speaker dependent and speaker independent scenarios. Also, different measures are applied to evaluate the performance of the presented enhancement procedures. Moreover, a statistical test is used to have a more precise performance evaluation for different considered methods in the various noisy conditions. The experimental results using different measures show that the presented supervised enhancement scheme leads to much better results in comparison with the baseline enhancement methods, learning-based approaches, and earlier wavelet-based algorithms. These results have been obtained for an extensive range of noise types including the structured, unstructured, and periodic noise signals in different SNR values.
منابع مشابه
A New Method for Multisensor Data Fusion Based on Wavelet Transform in a Chemical Plant
This paper presents a new multi-sensor data fusion method based on the combination of wavelet transform (WT) and extended Kalman filter (EKF). Input data are first filtered by a wavelet transform via Daubechies wavelet “db4” functions and the filtered data are then fused based on variance weights in terms of minimum mean square error. The fused data are finally treated by extended Kalman filter...
متن کاملWavelet transform-based speech enhancement
This paper describes a speech enhancement system using a novel combination of a Fast Wavelet Transform structure, together with “Wiener filtering” in the wavelet domain. The specific application of interest is the enhancement of speech when a cellular phone is used within a moving vehicle. Subjective tests carried out using speech with additive vehicle noise at a signal-to-noise ratio of 10 dB ...
متن کاملA Novel Image Denoising Method Based on Incoherent Dictionary Learning and Domain Adaptation Technique
In this paper, a new method for image denoising based on incoherent dictionary learning and domain transfer technique is proposed. The idea of using sparse representation concept is one of the most interesting areas for researchers. The goal of sparse coding is to approximately model the input data as a weighted linear combination of a small number of basis vectors. Two characteristics should b...
متن کاملNew Speech Enhancement Method based on Wavelet Transform and Tracking of Non Stationary Noise Algorithm
In this work, we have developed an efficient approach for enhancing speech by combining tracking of non stationary noise algorithm and Continues Wavelet Transform (CWT). Tracking of non stationary noise method that is based on data-driven recursive noise power estimation was proposed by Jan S. Erkelens and Richard Heusdens. The Continues Wavelet decomposition of speech signal uses adaptive leve...
متن کاملa new method for multisensor data fusion based on wavelet transform in a chemical plant
this paper presents a new multi-sensor data fusion method based on the combination of wavelettransform (wt) and extended kalman filter (ekf). input data are first filtered by a wavelettransform via daubechies wavelet “db4” functions and the filtered data are then fused based onvariance weights in terms of minimum mean square error. the fused data are finally treated byextended kalman filter for...
متن کاملA New Approach for Speech Enhancement Based On Singular Value Decomposition and Wavelet Transform
In this paper a new approach for speech enhancement is presented. The proposed algorithm is based on singular value decomposition (SVD) and wavelet transform. A model of contaminant noise is estimated by using SVD in the recommended method and then, using of noise estimation determines thresholding value. Needlessness of silence frame in order to estimate the noise model is an advantage of sugg...
متن کاملمنابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ذخیره در منابع من قبلا به منابع من ذحیره شده{@ msg_add @}
عنوان ژورنال
دوره 17 شماره 3
صفحات 17- 36
تاریخ انتشار 2020-11
با دنبال کردن یک ژورنال هنگامی که شماره جدید این ژورنال منتشر می شود به شما از طریق ایمیل اطلاع داده می شود.
کلمات کلیدی برای این مقاله ارائه نشده است
میزبانی شده توسط پلتفرم ابری doprax.com
copyright © 2015-2023